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Abstract 


Caused by a new coronavirus, severe acute respiratory syndrome (SARS) is a highly contagious disease associated with significant 
fatality that emerged in 2003. The molecular cause of the unusually high human pathogenicity of the SARS coronavirus (SARS- 
CoV) is still unknown. In an effort to characterize molecular components of the virus that are absent in other coronaviruses, all of which 
are considerably less pathogenic for humans, we recombinantly produced the SARS-unique domain (SUD) within non-structural protein 
3 (Nsp3) of SARS-CoV and characterized its nucleic-acid binding properties. Zone-interference gel electrophoresis and electrophoretic 
mobility shift assays revealed a specific affinity of SUD for oligo(G)-strings. A few such segments are present in the SARS-CoV genome, 
but also in mRNAs of host proteins involved in the regulation of signaling pathways. A putative role of SUD in virus-induced apoptosis 


or survival of host cells is discussed. 
© 2007 Elsevier Inc. All rights reserved. 
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Severe acute respiratory syndrome (SARS) is a highly 
infectious disease that first surfaced in 2002/2003 and was 
associated with a fatality rate of about 10% [1]. A new 
coronavirus, SARS-CoV, was identified as the causative 
agent [1-3]. SARS-CoV is an enveloped virus with a large 
(29.7 kb) single-stranded RNA genome of positive sense 
that encodes several structural and auxiliary proteins as 
well as two large overlapping polyproteins, ppla 
(~486 kDa) and pplab (~790 kDa) [4]. These polyproteins 
are processed by two proteases to yield 16 non-structural 
proteins (Nsp1—Nsp16), which are essential for RNA repli- 
cation and processing. 

Nsp3 is the largest among the non-structural proteins of 
SARS-CoV. It is believed to comprise at least six domains 
(Fig. 1A): (1) an N-terminal acidic domain which is rich in 
glutamate; (2) the *X domain’ which binds poly(ADP- 
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ribose) and has a fold similar to the macrodomain of his- 
tone 2A [5,6]; (3) the SARS-unique domain (SUD) which 
has only been found in SARS-CoV so far [4,7]; (4) a 
papain-like protease (PL2?"°), which also exhibits deubiq- 
uitinating activity and is involved in suppression of the 
type-I interferon response of the host cell [8-12]; (5) an 
additional, non-canonical papain-like domain identified 
by bioinformatics (Ferron and Canard, personal communi- 
cation); and (6) the ‘Y domain’, which contains an N-ter- 
minal transmembrane subdomain [13]. Except for the 
PL2°"°, the functions of these domains are unclear. Some 
may play a role as part of the replicase/transcriptase com- 
plex of the virus, and others may interact with components 
of the host cell. A recent study of the interactome of the 
proteins of the SARS coronavirus indicated some interac- 
tion between an N-terminal fragment of Nsp3 with Nsp2 
and with Orf3a. However, no results were obtained for 
the SARS-unique domain (SUD) because for cloning rea- 
sons, it was cleaved into two halves [14]. 
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Fig. 1. (A) Organization of SARS-CoV genome. The location of different Nsps in ORF 1a/1lab and their function is indicated. (B) Expression of full-length 
SUD (f-l) and SUDcSb (c5b). SDS-PAGE (15%, lanes 1 and 2) and Western blot using anti-(His)4-antibody (lanes 3 and 4) and anti-Nsp3-antibody (lanes 


5 and 6). 


We believe that the SUD might be of particular interest 
for elucidating the basis for the high pathogenicity of 
SARS-CoV for humans. All other coronaviruses, including 
the recently discovered human coronaviruses NL63 [15] 
and HKU1 [16], do not have this domain, and since they 
are much less pathogenic for humans, it is conceivable that 
there is a correlation between the presence of the SUD and 
viral pathogenicity. We therefore decided to produce this 
domain using recombinant DNA technology and in-vitro 
protein synthesis in order to study its interactions with 
components of the virus and the host cell. In this commu- 
nication, we show for the first time that full-length SUD as 
well as a truncated version bind oligo(G) stretches (“G- 
strings’) in nucleic acids. 


Materials and methods 


In-vitro protein synthesis. For expression of the SUD, a plasmid 
encoding the full-length domain with an N-terminal His-tag and a factor 
Xa recognition sequence was constructed by transfer of the NcoI-Xhol 
fragment from pETBlue-2-SUD (see below) into the dephosphorylated 
pIVEX1.3-WG vector (Roche) cleaved with the same restriction enzymes. 
After expression using the RTS wheat-germ continuous-flow cell-free 
system (Roche), the His-tagged protein was purified using immobilized 
metal-affinity chromatography (IMAC; HisTrap HP column), followed by 
elimination of imidazole through a HiTrap desalting column (GE 
HealthCare) equilibrated with 50 mM Tris-HCl, 300 mM NaCl, pH 7.5. 
Protein purity was estimated by 15% SDS-—polyacrylamide gel electro- 
phoresis (PAGE) under denaturing conditions. After transfer to a nitro- 
cellulose membrane (Schleicher & Schuell), the protein was 
immunologically detected by anti-His (Novagen) or anti-Nsp3 (Rockland) 
antibodies. 

Expression of full-length SUD in Escherichia coli. For the production of 
the full-length SUD in E. coli, the coding region corresponding to residues 
349-726 of Nsp3 was PCR-amplified using sense (GGATTATCTTGATA 
ACCTGAAGC) and anti-sense (GTCTTAACCTCCCGCAGGGATAA) 


primers and SARS-CoV cDNA (TOR2 strain; Accession No. AY274119; 
kind gift of Dr. J. Ziebuhr) as template. The PCR product was cloned into 
the EcoRV site of the plasmid pETBlue-2 (Novagen), resulting in a gene 
encoding a protein with three additional amino acids (M-A-M) at the N- 
terminus and a C-terminal extension of 56 residues including a C-terminal 
(His), tag. The fusion gene was expressed in E. coli Tuner DE3 (pLaclI) 
cells (Novagen). 

Expression of the SUDc5b fragment. Step-wise N- and C-terminal 
truncations led to a stable minimal version of the SUD protein consisting 
of residues 389-562 of Nsp3, named SUDcSb. The coding region for this 
protein was PCR-amplified from the plasmid encoding the full-length 
protein and ligated into the Stul site of the plasmid pQE30, providing an 
N-terminal His-tag and a short linker sequence. E. coli M15 (pRep4) was 
used as expression host for this construct. The coding regions of all 
expression plasmids were verified by DNA sequencing. 

Purification of SUDc5Sb. The cells were resuspended in lysis buffer 
(10 mM imidazole, 300 mM NaCl, 25mM NaH>PO,, pH 8.0) and dis- 
rupted with a sonicator. After centrifugation at 16,000g (20 min, 4 °C), 
Benzonase was added to the supernatant, followed by ultracentrifugation 
at 38,000g (1 h, 4°C). SUDcS5b was purified by IMAC and desalted as 
described above for the full-length protein. Further purification was 
achieved by ion-exchange chromatography using a gradient from 25 mM 
to 1M NaCl. In all experiments, protein concentration was determined 
using the BCA Protein Assay Kit (Pierce) or by absorbance at 280 nm. 
The protein was stored at 4 °C. 

Analytical size-exclusion chromatography. The apparent molecular 
mass of SUDcSb was determined by an analytical BioSep-SEC-S 3000 
column (Phenomenex) connected to an HPLC DuoFlow (Bio-Rad). The 
column was calibrated in 25mM Tris-HCl, 300mM NaCl, 0.5mM 
EDTA, pH 7.5, using the following standard proteins: cytochrome c 
(12.5 kDa), chymotrypsinogen (25.0 kDa), ovalbumin (42.75 kDa), bovine 
serum albumin (66.4 kDa), and glucose oxidase (160 kDa). The sample 
volume was 25 pl with a concentration of 2 mg/ml. 

Dynamic light-scattering. Dynamic light-scattering (DLS) was per- 
formed using a Laser-Spectroscatter 201 (RiNA GmbH). After centrifu- 
gation for 1 h at 4°C and 13,000g, sample (20 il) was applied to a 10-mm 
quartz cuvette (Hellma). 

Electrophoretic mobility shift analysis (EMSA). Radioactive RNA 
probes bbc3-Gj4 and bbc3-A\4 (see Results and discussion), respectively, 
were transcribed by T7 RNA polymerase in the presence of [a-*°P]-GTP 
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(3000 Ci/mmol, Hartmann) from the following deoxyoligonucleotide 
duplexes containing the T7 promoter (underlined, only the sense strand 5’- 
to-3’ direction is indicated): TGTAATACGACTCACTATAGGGTGA 
CACTGGGGGGGGGGGGGGCTCTCCTCTCGGTGCTCCTTCAC 
TC and TGTAATACGACTCACTATAGGGTGACACTAAAAAAA 
AAAAAAACTCTCCTCTCGGTGCTCCTTCACTC. EMSA was per- 
formed with radioactive RNA probes as described [17] with the following 
modifications: the reaction mixture containing **P-labeled RNA (1— 
3x 10+ cpm) and 1-5 pg of the protein tested in binding buffer (5 mM 
Hepes, pH 7.9, 25mM KCl, 2mM MgCh, 1.75mM ATP, 6mM DTT, 
0.05 mM phenylmethylsulfonyl fluoride, 166 pg/ml of E. coli tRNA, and 
5% glycerol) was incubated for 20 min at 30 °C. For electrophoresis, non- 
denaturing polyacrylamide 4-12% gradient TBE gels (Invitrogen) were 
used. RNA-protein complexes were visualized using a PhosphorImager 
(Fujifilm BAS 1000) and the software PCBAS (Raytest). 

Zone-interference gel electrophoresis. The zone-interference gel elec- 
trophoresis (ZIGE) device was adapted from [18] (see Fig. S1 in Supple- 
mentary Material). The agarose was dissolved in 1x TBE buffer. The 
protein was incubated at RT for 0.5 h with deoxyoligonucleotides of dif- 
ferent lengths (6-14 and 24 in case of bbc3), with concentrations of 0- 
32 uM in a sample volume of 10 ul. The samples were mixed with 
dimethylsulfoxide (DMSO; final concentration of 10% (v/v)) and a trace 
of bromophenolblue (BPB). These protein-oligonucleotide samples were 
applied to the small slots (see Fig. S1). Oligonucleotide with the same 
concentration as in the small slots was also mixed with DMSO and BPB in 
1x TBE buffer and applied to the long slots of the gel (total volume 
100 pl). Electrophoresis was performed at 4°C for 1h with a constant 
current of 100 mA. Staining was performed as outlined in [18]. 


Results and discussion 


The DNA sequence encoding the full-length SUD (resi- 
dues 349-726 of Nsp3; see Fig. 1A), also referred to as 
Nsp3c, was successfully expressed with an N-terminal 
His-tag in E. coli (not shown) and in the in-vitro RTS 
wheat-germ system and purified to homogeneity (Fig. 1B, 
lane 1). Besides the full-length SUD, a soluble, more stable 
minimal version, SUDc5b, was constructed by systematic 
N- and C-terminal truncation and produced, with an N- 
terminal His-tag, in E. coli (Fig. 1B, lane 2). The identity 
of the proteins was verified by N-terminal sequencing 
(not shown) and reaction with anti-(His)4 and anti-Nsp3 
(SARS-CoV) (Fig. 1B, lanes 3-6). 

Independent of the expression system used, the full- 
length SUD displayed a strong and reproducible tendency 
to proteolytic (self-)degradation. This was reduced but still 
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detectable in the preparations of the SUDcSb protein. 
Analytical size-exclusion chromatography and dynamic 
light-scattering measurements indicated that SUDc5b is a 
monomeric protein (31 kDa) (see Figs. S2 and S3A in 
Supplementary Material). 

We were able to show that both full-length SUD and its 
truncated version, SUDc5b, bind single-stranded DNA 
and RNA. For short nucleotides (decamers), this interac- 
tion is highly specific for oligo-guanidine nucleotides. Both 
deoxyribo- and ribo-Gj, are bound with comparable affin- 
ities (Fig. 2A and B). Kg values derived from zone-interfer- 
ence gel electrophoresis (ZIGE; [18]) are in the low 
micromolar range (in case of SUDcS5b, 1.0 uM for (dG) 
and 0.93 uM for Gio). The binding affinity is dependent 
on the length of the oligo(G) nucleotide. For SUDc5b, 
the highest affinity was found for (dG),2_14 (not shown). 
For full-length SUD, the Kg value derived from ZIGE is 
about 2.1 uM (Fig. 2C). 

A database and literature search for potential oligo(G) 
stretches that might be recognized by SUD within the host 
cell revealed interesting potential targets. Since SUD does 
not seem to differentiate between DNA and RNA, the 
availability of potential binding sites will depend on the 
sub-cellular localization of the protein. Oligo(dG) 
sequences, so called G-strings, have been identified as bind- 
ing sites for transcription factors in eukaryotic cells [19,20]. 
Since there is no evidence for a nuclear localization of Nsp3 
in the host cell [8,14], we focused on potential cytosolic 
interaction partners. Oligo(G) sequences are found in 3’- 
UTRs of several mRNAs including the mRNA for the 
human pro-apoptotic protein Bbc3 (Accession No. 
U82987; [21]), the transcript variant 1 of the mRNA for 
the human MAP kinase 1 (NM_002745), the mRNA for 
the human RAB6B protein, a member of the Ras oncogene 
family (NM_016577), and the mRNA for TAB3, a compo- 
nent of the NF-«B signaling pathway (NM_152787). These 
proteins are prime candidates for an interference of the 
virus with cellular signaling pathways. Changes in the sta- 
bility and/or translation efficiency of these mRNAs due to 
the binding of a regulatory factor should result in an 
altered response of the infected cell to extracellular signals. 
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Fig. 2. Zone-interference gel electrophoresis experiments (ZIGE) performed to evaluate the affinity of SUD for different nucleic-acid species. (A) SUDc5b 
in presence of increasing concentrations (1, 2, 4, 8, 16, and 32 4M, lanes 2-7) of (dG)jo. Lanes labeled c5b, A, C, and T correspond to nucleic acid-free 
protein, mixture of SUDc5b with 16 uM of (dA)j10, (dC)10, and (dT)10, respectively. (B) SUDc5b in presence of increasing concentrations (1, 2, 4, 8, 16, and 
32 uM, lanes 2-7) of ribo-Gyjo. (C) Full-length SUD (f-1) in presence of increasing concentrations (1, 2, 4, 8, 16, and 32 uM, lanes 2-7) of (dG). 
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The truncated SUDc5b binds efficiently (Kg = 0.45 UM) 
to an oligodeoxynucleotide (named bbc3-(dG),4) of 24 
bases in length, with a central (dG),4 stretch, which we 
used as a model for the bbc3 mRNA. The replacement of 
the (dG) 4 stretch by (dA),4 leads to a dramatically reduced 
binding affinity (Fig. 3). Similar results were obtained for 
full-length SUD (not shown). 

To further assess the RNA-binding affinity of SUD and 
SUDc5b, a radioactive RNA probe (named bbc3-G,,4) con- 
taining 14 G bases and flanking regions (a total of 48 bases) 
from the bbc3 gene (nucleotides 983-1030) was transcribed 
by T7 RNA polymerase from a synthetic deoxynucleotide 
duplex template harbouring the T7 promoter [22]. As a 
control for the specificity of the RNA —protein interaction, 
an RNA probe (bbc3-A,4) containing an A,,4-string instead 
of Gj4 was prepared in a similar manner. An electropho- 
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Fig. 3. ZIGE results illustrating the complex formation of SUDc5b with 
bbc3-(dG),4. (A) SUDc5b in presence of increasing concentrations (1, 2, 4, 
8, 16, and 32 uM, lanes 2-7) of bbc3-(dG)\4. Lanes labeled c5b and A 
correspond to nucleic acid-free protein and mixture of SUDc5b with 
32 uM of bbc3-(dA),4 (lane 8), respectively. The sequence of bhc3-(dG) 4 is 
5’-ACA CTG GGG GGG GGG GGG GCT CTC-3’, and the sequence of 
bbc3-(dA)14 is 5'-ACA CTA AAA AAA AAA AAA ACT CTC-3’. (B) 
The Kg value is derived from the slope of the graph [18]. 


retic mobility shift assay (EMSA) was used to probe the 
interaction between SUD proteins and RNAs. To prevent 
non-specific binding of radioactive RNA probes to the pro- 
teins, a 1000-fold molar excess of unlabeled tRNA was 
added to the reaction mixtures. As can be seen from 
Fig. 4A, full-length SUD is able to retard the mobility of 
the radioactive bbc3-G,4 RNA in a concentration-depen- 
dent manner (lanes 1-4). An anti-Nsp3 antibody that had 
been tested for specific interaction with SUD (see 
Fig. 1B, lanes 5 and 6), but not an irrelevant anti-T7 
RNA polymerase antibody, supershifts the migration of 
the SUD-RNA complex (Fig. 4A, lanes 7 and 8, respec- 
tively), implying direct and highly specific interaction 
between SUD and bbc3-Gj4. Both full-length SUD 
(Fig. 4A, lanes 2-4 and 6) and SUDc5b (Fig. 4B, lane 1) 
are able to retard the electrophoretic mobility of free 
bbc3-G,4 RNA due to RNA-protein complex formation. 
In contrast, bbc3-A,;4 RNA is unable to form a similar 
complex either with full-length SUD (Fig. 4B, lane 4) or 
with SUDc5b (Fig. 4B, lane 5), thus confirming the speci- 
ficity of the RNA-—protein interaction. Interestingly, the 
appearance of an additional, extremely slow-migrating 
band was observed with SUDcSb and bbc3-G,;4 RNA 
(Fig. 4B, lane 1, asterisk), but not with bbc3-A,4 RNA 
(lane 5). This is possibly due to a multimerization of a spe- 
cific RNA-protein complex. The ability of SUDc5b to 
form multimers in presence of (dG)g was confirmed by 
dynamic light-scattering (Fig. S3B). No interaction of 
bovine serum albumin (BSA), included as a further control 
of specificity, with bbc3-G,4 RNA (Fig. 4B, lane 2) or with 
bbc3-A,4 RNA (Fig. 4B, lane 6) was detectable. 

The observed affinity of SUD to the G-string of bbc3 
RNA (bbc3-G,4) is highly specific, since a 1000-fold molar 
excess of tRNA is not able to prevent the formation of the 
complex. Moreover, the exchange of G14 to Aj4 completely 
prevents the interaction. 

Other possible interaction partners for SUD are the cor- 
onaviral RNAs within the infected cells. We found three 
potential binding sites in the genome of the TOR2 isolate 
of the SARS coronavirus. A single Ge motif (nucleotides 
21,604—21,609) and two C¢-motives (nucleotides 17,386— 
17,391 and 28,157—28,162) are present in the (+)-strand 
(Fig. S4A). SUDcSb also binds to G¢-containing deoxyri- 
bonucleotides derived from these regions of the viral gen- 
ome, again with an estimated affinity of approximately 
1 uM (Fig. S4B and C). The binding affinity to the reversed 
complementary sequences representing the opposite strand 
of the genome is considerably lower (Fig. S4). However, in 
accordance with the preferred length of the oligo(G) stretch 
of 12-14 nucleotides, the highest binding affinity was found 
for the oligonucleotides derived from the human bbc3 
mRNA. 

As mentioned before, G;,_;5 motifs are present in the 3’- 
UTRs of human mRNAs encoding Bbc3, MAP kinase 1, 
RAB6B, and TAB3 proteins, all of which are components 
of cellular signaling pathways. It is conceivable that bind- 
ing of SUD to these RNAs will result in an alteration of 
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Fig. 4. (A) Concentration-dependent binding of full-length SUD to [**P labeled (1.5 x 10° cpm) bbc3-G,4 RNA representing a segment of bbc3 mRNA, 
and demonstration of specificity of interaction by the formation of a supershift. Indicated amounts of the full-length SUD (lanes 1-4) were incubated in 
binding mixtures before electrophoresis. In lanes 7 and 8, full-length SUD (6.4 1M) was incubated either with anti-Nsp3 or with anti-T7 RNA polymerase 
antibodies. (B) Specificity of interaction to bbc3-G,4 RNA. Full-length SUD (lane 4), its truncated form c5b (lanes 1 and 5), or BSA (lanes 3 and 6) were 
incubated with radioactive bbc3-G;4 RNA (lanes | and 2) or with control RNA, bbc3-Aj4 (lanes 3-6). Retarded and supershifted RNA—protein complexes 
are shown with arrowheads. Free RNAs are shown with arrows. Unspecific retardation of radioactivity, detected also in lanes with free RNAs and BSA, is 


marked with the bracket. 


their stability, or translation efficiency, as it has been dem- 
onstrated for eukaryotic trans-acting factors such as HuR, 
CP1, and TIA-1 [23-26], and thereby will interfere with cel- 
lular signal transduction. Bbc3 is a pro-apoptotic protein 
that is induced by a variety of apoptotic stimuli. It is sub- 
ject to transcriptional regulation by multiple cell death-sig- 
naling pathways [21] and has been demonstrated to 
mediate apoptosis in neuroblastoma cells [27]. Blocking 
Bbc3 production at the translation level could override 
the transcriptional activation and render a cell less sensitive 
to apoptotic signals. Similarly, an interaction of SUD with 
the TAB3 mRNA may contribute to the recently described 
NF-«B activation and the increased CXCL10 (interferon y- 
inducible protein 10) levels observed in the blood of SARS 
patients [28]. In addition to the spike, nucleocapsid, Orf3b, 
Orf6, Orf7a, and PL2°"° proteins [12,28], SUD could thus 
be another component of SARS-CoV involved in the mod- 
ulation of host-cell signaling pathways. 
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